Reverse Engineering Gene Networks with Microarray Data
نویسندگان
چکیده
We consider the question of how to solve inverse problems of the form ex(0) = x(t) for an unkown matrix A, given measurements of x(t) at different time points. Problems of this form have applications in reverse engineering gene networks. In particular, we examine the cases where A is circulant and Toeplitz. We are also able to extend our findings of the circulant case to some generalizations of circulant matrices. Introduction DNA Microarrays can be used to determine measurements of cellular gene products at a given point in time. These concentrations of gene product provide clues to the overall interaction of the genes in the gene network being studied. We can measure the perturbations x1, . . . , xn of the gene mRNA expression concentrations from the steady state, which are governed by the equation dx(t) dt = Ax(t), (1) where x(t) = (x1(t), . . . , xn(t)) T . In other words, the rates of change of the gene product concentrations are determined by the deviations from the steady state of all the gene products present. If A is unstructured, then A contains n degrees of freedom, so we would expect that n concentration measurements would suffice to determine A uniquely. However, measuring the given gene product concentrations at a particular time is both time consuming and expensive. Gene networks can be on the order of 10,000 genes, so taking 10 measurements is impossible. Hence, we would like to impose some structure on A that will allow us to take fewer gene concentration measurements, enabling us to determine A more easily. This will provide a solution to our gene network which is easy to find and can provide a starting point for determining the exact structure of the gene network. In particular, we will first study the case where A is circulant and the case where A is Toeplitz. It is expected that if A is circulant, then A can be uniquely determined from a measurement of the n gene products at a single point in time. This seems intuitive since a circulant A contains only n degrees of freedom. If A is Toeplitz, we expect A may be determined by the measurement of the n gene products 1 at two points in time since Toeplitz A contains 2n− 1 degrees of freedom. Once A is determined, the gene product concentrations for any time t are given by x(t) = e x0, (2) where x0 = x(0) is the vector of gene product deviations from the steady state caused by a perturbation to the system at t = 0. Circulant Matrices Circulant matrices are those square matrices C of the form C = circ(c1, c2, . . . , cn) =
منابع مشابه
Using Coarse-Grained, Discrete Systems for Data-Driven Inference of Regulatory Gene Networks: Perspectives and Limitations for Reverse Engineering
This contribution gives an initial report of a new project exploring the perspectives and limits of reversely engineering regulatory gene networks from gene expression data. The availability of such data is currently increasing dramatically due to the microarray technology. However, inferring the underlying network from expression data is difficult. We address the reverse engineering problem by...
متن کاملCNET: an algorithm for Reverse Engineering of Causal Gene Networks
We present a novel Reverse Engineering algorithm, CNET, to reconstruct Gene Regulatory Networks from microarray time series data. CNET can be considered an improvement of the Mutual Information approach, present in the REVEAL [5] algorithm, with an innovative scoring function, to cope with noise, quantization errors and gene characteristic transcription delays. We tested the algorithm on simula...
متن کاملRobust Reverse Engineering of Dynamic Gene Networks Under Sample Size Heterogeneity
Simultaneously reverse engineering a collection of condition-specific gene networks from gene expression microarray data to uncover dynamic mechanisms is a key challenge in systems biology. However, existing methods for this task are very sensitive to variations in the size of the microarray samples across different biological conditions (which we term sample size heterogeneity in network recon...
متن کاملGenetic Network Analysis in Light of Massively Parallel Biological Data Acquisition
Complementary DNA microarray and high density oligonucleotide arrays opened the opportunity for massively parallel biological data acquisition. Application of these technologies will shift the emphasis in biological research from primary data generation to complex quantitative data analysis. Reverse engineering of time-dependent gene-expression matrices is amongst the first complex tools to be ...
متن کاملReverse Engineering of Genetic Networks Using Variable Length Genetic Algorithms with Boolean Networks
Nowadays, given the new microarray technology, a huge amount of data on gene expression is available. In order to understand the genetic expression process more completely, we need to know the control structure of the genetic expression. We present a new scheme for a genetic algorithm that shows promising results when modelling boolean networks. We can effectively and quickly obtain good approx...
متن کاملA Survey on Recurrent Neural Network Based Modelling of Gene Regulatory Network
The correct inference of gene regulatory networks (GRN) remains as a fascinating task for researchers to understand the detailed process of complex biological regulations and functions. With availability of large dimensional microarray data, relationships among thousands of genes can be extracted simultaneously that is a reverse engineering problem. Among the different popular models to infer G...
متن کامل